Conference Proceedings
Context-Uncertainty-Aware Chatbot Action Selection via Parameterized Auxiliary Reinforcement Learning
Chuandong Yin, Rui Zhang, Jianzhong Qi, Yu Sun, Tenglun Tan, D Phung (ed.), VS Tseng (ed.), GI Webb (ed.), B Ho (ed.), M Ganji (ed.), L Rashidi (ed.)
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) | SPRINGER INTERNATIONAL PUBLISHING AG | Published : 2018
Abstract
We propose a context-uncertainty-aware chatbot and a reinforcement learning (RL) model to train the chatbot. The proposed model is named Parameterized Auxiliary Asynchronous Advantage Actor Critic (PA4C). We utilize a user simulator to simulate the uncertainty of users’ utterance based on real data. Our PA4C model interacts with simulated users to gradually adapt to different users’ utterance confidence in a conversation context. Compared with naive rule-based approaches, our chatbot trained via the PA4C model avoids hand-crafted action selection and is more robust to user utterance variance. The PA4C model optimizes conventional RL models with action parameterization and auxiliary tasks for..
View full abstractGrants
Awarded by Australian Research Council (ARC)
Funding Acknowledgements
This work is supported by Australian Research Council (ARC) Future Fellowships Project FT120100832 and Discovery Project DP180102050.